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One of the basic properties of a queueing network is stability. 
Roughly speaking, it is the property that the total number of jobs in 
the network remains bounded as a function of time. One of the key 
questions related to the stability issue is how to determine the exact 
conditions under which a given queueing network operating under a 
given scheduling policy remains stable. While there was much initial 
progress in addressing this question, most of the results obtained 
were partial at best and so the complete characterization of stable 
queueing networks is still lacking. 

In this paper, we resolve this open problem, albeit in a somewhat 
unexpected way. We show that characterizing stable queueing net- 
works is an algorithmically undecidable problem for the case of non- 
preemptive static buffer priority scheduling policies and deterministic 
interarrival and service times. Thus, no constructive characterization 
of stable queueing networks operating under this class of policies is 
possible. The result is established for queueing networks with finite 
and infinite buffer sizes and possibly zero service times, although we 
conjecture that it also holds in the case of models with only infinite 
buffers and nonzero service times. Our approach extends an earlier 
related work [Math. Oper. Res. 27 (2002) 272-293] and uses the so- 
called counter machine device as a reduction tool. 

1. Introduction. Queueing networks are ubiquitous tools for modeling 
a large variety of real-life processes, such as communication and data net- 
works, manufacturing processes, call centers, service networks and many 
other real-life systems. It is an important task to design and operate queue- 
ing networks so that their performance is acceptable. One of the key quali- 
tative performance measures is stability. Roughly speaking, a queueing net- 
work is stable if the total expected number of jobs in the network is bounded 
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as a function of time. In a probabilistic framework, which is typically used to 
formalize the stability question, it means that the underlying queue length 
process is positive (Harris) recurrent; see [18, 19, 41]. We do not provide 
a formal definition of this notion here as, throughout the paper, we con- 
sider exclusively deterministic queueing networks, for which stability simply 
means that the total number of jobs in the network remains bounded as a 
function of time. The details of the model description and formal definitions 
of stability are delayed until the next section. 

The research on stability questions started with the works of Kumar and 
Seidman [36], Lu and Kumar [37] and Rybko and Stolyar [45], who, for the 
first time, identified queueing networks and work-conserving scheduling poli- 
cies leading to instability, even though every processing unit was nominally 
underloaded. Namely, the condition ps < 1 was satisfied by every server S. 
Here, ps is the average utilization in server S which is measured roughly as a 
ratio of the total arrival rate into this server to the service rate of this server 
(see the next section). This initiated the search for tight stability conditions. 
Important advances were obtained in this direction, most notably the de- 
velopment of the fluid model methodology, which significantly simplifies the 
stability issue by reducing the underlying stochastic problem to a simpler, 
deterministic continuous-time continuous-state problem; see [19, 48]. It was 
established that the stability of the fluid model implies stability of the un- 
derlying stochastic network [19, 48] and, partially, the converse result holds 
as well [20, 30, 40, 43], although not always; see [14, 21]. Yet, even character- 
izing stability of fluid models turned out to be nontrivial [7, 23, 24, 25] and 
no full characterization is available either. Meanwhile, it was discovered that 
certain classes of networks and scheduling policies are universally stable. For 
example, networks with feedforward (acyclic) structure were proven to be 
stable under an arbitrary work-conserving scheduling policy; see [18, 19, 22]. 
First Buffer First Serve, Last Buffer First Serve static buffer priority type 
scheduling policies were shown to stabilize an arbitrary queueing network 
satisfying a certain topological restriction (a so-called re-entrant line); see 
[25, 35]. A certain simple scheduling policy based on due dates was shown 
to stabilize an arbitrary network [15]. The First-In, First-Out (FIFO) policy 
was proven to be stable in networks where service rates within each server 
are identical — the so-called Kelly-type networks [13]. At the same time, some 
simple static buffer priority policies are not necessarily stable, as was shown 
in the original works on instability; see [36, 37, 45]. Also, FIFO policy can 
lead to instability; see [12, 46]. 

While most of the aforementioned research activity was conducted in the 
operations research, electrical engineering and mathematics communities, in 
parallel and independently, the stability problem was investigated by the the- 
oretical computer science community using the Adversarial Queueing Net- 
work (AQN) model. The motivation there comes from data networks and 
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the models are somewhat different: no probabilistic assumptions are made 
on either the arrival or service processes. Instead, an adversary is assumed 
to inject jobs (communication packets) into the network, which is repre- 
sented as a graph. The links of the graph serve the roles of processing units 
and the processing times are typically assumed to be equal to one unit of 
time deterministically. In this setting, the model is defined to be stable if, 
for every pattern of packet injections, subject to certain load conditions, 
the total number of packets remains bounded as a function of time. The 
AQN was introduced by Borodin et al. [9] and further researched by many 
authors; see [1, 2, 3, 4, 26, 28, 31, 38, 44, 49]. Many results similar to the 
stochastic networks counterpart were established. It was shown that while 
AQN corresponding to an acyclic graph is always stable [9], there are AQN 
and scheduling policies (usually called protocols) which are work-conserving 
(usually called greedy) and which lead to instability; see [1, 31]. It was also 
established that FIFO can lead to instability [1], even with arbitrary small 
injection rates [6]. The relevance of fluid models to AQN was established in 
[26]: stability of the fluid model implies stability of AQN. A partial converse 
result holds, as was also shown in [26]. Yet, despite impressive progress in 
the area and interesting parallel development to the stochastic counterpart, 
tight characterization of stable AQN has still not been achieved. 

In this paper, we frame the problem of characterizing stable queueing 
networks as an algorithmic decision problem: given a queueing network and 
an appropriately defined scheduling policy, determine whether the network 
is stable. In order to introduce the problem formally, we consider the sim- 
plest possible setting: the interarrival times and service times are assumed 
to take deterministic rational values. Throughout the paper, we focus ex- 
clusively on a simple class of scheduling policies, namely the class of non- 
preemptive static buffer priority scheduling policies. We assume that buffers 
have finite or infinite capacity. Jobs which, upon arrival, see a full (finite) 
buffer are dropped from the network. Also, we assume that some of the ser- 
vice times can take zero value. The assumptions of finite buffers and zero 
service times are the only important departures from models studied in the 
stability literature prior to our work. They are adopted for proof tractabil- 
ity, although we conjecture that our main results remain true in the case of 
infinite buffer /nonzero service times case as well. The details of the model 
are given in the following section. 

Our main result is that stability of a queueing network operating under 
a static nonpreemptive buffer priority policy is an undecidable property. 
Thus, no constructive means of characterizing stable queueing networks for 
this class of policies is possible. This resolves the open problem of providing 
tight characterization of stable queueing networks for the class of static non- 
preemptive buffer priority policies. Our work extends an earlier work [27] by 
the first author, were the undecidability result was established for the class 
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of so-called generalized priority scheduling policies. Later, this work was 
extended to the problems of computing stationary distributions and large 
deviations rates [29]. There are important differences between the current 
work and [27]. The class of generalized priority policies was not consid- 
ered in the literature prior to [27]. Additionally, generalized priority policies 
allow idling, whereas most of the work on stability analysis focuses on work- 
conserving scheduling policies. Also, [27] considered the single-server setting, 
whereas, here, we consider the network setting. We note that for the class 
of buffer priority policies (as well as any other work-conserving scheduling 
policies), the question of stability of a single server model is trivially de- 
cidable: one needs to compute the load factor p. The system is then stable 
if and only if p < 1 (p < 1 if all of the interarrival and service times are 
deterministic) . 

The concept of undecidability was introduced in the classical works of 
Alan Turing in the 1930s and it is one of the principal tools for establish- 
ing limitations of certain computational problems. The first problems which 
were established to be undecidable included the Turing halting problem, the 
post correspondence problem and several related problems [47]. Typically, 
one establishes undecidability of a given problem by taking a problem which 
is already known to be undecidable and establishing a reduction from this 
problem to the given problem of interest. This method is well known in the 
computer science literature as the reduction method. Lately, several problems 
were proven to be undecidable in the area of control theory; see [5, 16, 17]. 
In particular, the work of Blondel et al. [5] used a device known as a counter 
machine or counter automata as a reduction tool. In the present paper, as 
in [5], as well as [27], our proof technique is also based on a reduction from a 
counter machine model, although the construction details are substantially 
different from those of [27]. We use a well-known Rybko-Stolyar network [45] 
as a gadget and construct an elaborate queueing network which is able to 
emulate the dynamics of an arbitrary counter machine. The undecidabil- 
ity result is then a simple consequence of the undecidability of the halting 
problem for a counter machine, which is a classical result; see [33]. 

The remainder of the paper is organized as follows. The model descrip- 
tion and the main result are provided in the following section. Background 
material on a counter machine and undecidability is given in Section 3. Sec- 
tion 4 is devoted to constructing a reduction from a counter machine to a 
queueing network. Section 5 is devoted to the proof of the main result. It 
begins with a sketch of the proof, followed by the proof details. In the last 
subsection of this section, we show that while the condition p$ < 1 is not 
satisfied by every server in the network we construct, a simple modification 
achieves this condition. Some concluding thoughts and questions for further 
research are given in Section 6. 
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2. Model description and the main result. 

2.1. Deterministic multiclass queueing networks and a static buffer pri- 
ority scheduling policy. A multiclass queueing network is described as a 
collection of J service nodes, Si,..., Sj, and ./V job classes, 1, 2, . . . , JV. Each 
node is assumed to be single-server type and can process at most one job 
at a time. Each class i is associated with a unique buffer, also denoted by 
i, for convenience. The capacity Bi of the buffer i is finite or infinite and 
denotes the number of jobs which can be stored in the queue of the class i, 
not including the job in service, if any. The queue length corresponding to 
class i is the number of jobs in buffer Bi plus possibly (at most one) job cur- 
rently in service and is denoted by Qi(t). The total queue length Y^ieS^ Qi(t) 
corresponding to the server Sj at time t is denoted by Qsj (t) ■ 

Each class i is associated with an external arrival process Ai(0,t) which 
denotes the total number of jobs which arrived externally to the buffer 
Bi during the time interval [0, t]. The arrival processes typically consid- 
ered in the literature are either random renewal processes (in the stochas- 
tic queueing networks literature) or adversarial processes (in the computer 
science literature). Throughout this paper, we adopt the following simple 
assumption: the intervals between the arrivals of jobs is a deterministic 
class-dependent rational quantity a, and the initial delay is some rational 
bi. Thus, the external arrivals corresponding to the class i occur exactly at 
times nai + b, n = 0, 1, . . . , and Ai(0, t) = \_(t — b) /a,, J + 1. The external arrival 
rate is then Aj = l/a,. Let A = (At, 1 < i < N). Some classes may not have 
an associated external arrival process, in which case Oj = oo (At = 0) and 
Ai(0,t) = for all t > 0. We will also write Ai(t) = 1 if there is an arrival at 
time t (i.e., t = a^n + bi for some n G Z + ) and Ai(t) = otherwise. Each class 
i is associated with a deterministic service time < rrij < oo which takes a 
nonnegative rational value. The service rate is Hi = 1/mj. We allow service 
times to take zero value, namely /ij = oo. This assumption is a departure 
from models considered in prior literature and is adopted for proof tractabil- 
ity. We say that at a given time t, server S is busy only if, at time t, the 
server is working on a job which requires a nonzero remaining service time. 
For every collection of classes V, the associated workload Wv(t) at time t is 
the total time required to serve jobs which are presently in the network and 
which will eventually arrive into classes in V, in the absence of new arrivals. 

The routing of jobs in the network after the service completions is con- 
trolled as follows. A zero-one N x N sub-stochastic matrix R is fixed. 
Namely, the row sums of this matrix add up to at most unity and the 
spectral radius of this matrix is strictly less than unity. For every pair of 
classes i, I such that R41 = 1, every job which completes service in class i at 
some time t is immediately routed to buffer B[ after the service completion. 
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If the buffer is not full, that is, Qi(t) < Bi, then the job is added to the end 
of the queue in the buffer. If the buffer is full, that is, Qi(t) = B[, then the 
job is dropped from the network. The special case B\ = is interpreted as 
follows: a job routed to class I is accepted if and only if the server is idle 
and can begin processing this job immediately. In fact, the network we will 
build in Section 4 will only have B\ = or B\ = oo. If class i is such that 
Ri t i = for all /, then the jobs in class i after the service completion depart 
from the network. Since the routing matrix R has spectral radius less than 
unity, then R m = for some m. Namely, every job leaves the network after 
some finite number of re-routings. The equation A = A + i? T A, also known 
as the traffic equation, then admits a unique solution, explicitly given as 
A = [I — P T ]~ 1 A. Here, i? T denotes the transpose of R. For every server S, 
the quantity ps = YlieS^i/ fa i s defined to be the traffic intensity or load 
factor in server S. 

The selection of jobs for processing is controlled using some scheduling 
policy. In the present paper, we exclusively consider the class of static non- 
preemptive buffer priority scheduling policies. Any such policy ir is described 
as follows. For each server Sj, a permutation 8j of the elements of classes 
belonging to Sj is fixed. At time t = and at every time instance t corre- 
sponding to a service completion in Sj, the server Sj finds the index i 6 Sj 
with the smallest value Oj(i) such that Qi(t) > 0, selects the job in the 
head of this queue and begins working on it. If J2i<=Sj Qii s ) = 0> then the 
server idles until the first time that a job appears in one of the classes 
and starts working on this job. The vector 9 = (9j),l < j < J , then com- 
pletely specifies the scheduling policy it. In particular, the scheduling pol- 
icy is nonpreemptive and nonidling: no service is every interrupted and no 
server idles whenever at least one of the corresponding queues is nonempty. 
Static buffer priority policies have been studied extensively in the literature; 
see [8, 10, 11, 23, 25, 34, 37, 42, 45]. 

A queueing network, described by servers Sj,l < j < J, classes i = 1,2, . . . , 
N, the routing matrix R, inter arrival times aj, delays bi and service times mi 
will be denoted by Q for brevity. The queueing network Q, together with the 
scheduling policy ir and the vector of initial queue lengths (Qi(0), 1 < i < N), 
completely determines the queue length dynamics of the network, namely 
the vector process Q(s) = (Qi(s),s > 0). 

Definition 1. A triplet (Q,ir, Q(0)) is defined to be stable if 

(1) sup ^2 Qi(s) <oo. 

s -° l<i<N 

A queueing network Q together with the scheduling policy ir is defined to 
be stable if (Q,vr, Q(0)) is stable for every Q(0). 
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When all buffers in the network are infinite, the so-called load condition 
p$ < 1 for all servers S is necessary for stability. The presence of finite buffers 
may change the situation, as, for example, the model is trivially stable when 
all of the buffers are finite. Nevertheless, we will see that the load condition 
is satisfied by all servers in the specific queueing models we construct in this 
paper, after appropriate modifications described in Section 5.2. 

In models with probabilistic settings, (Q(s),s > 0) is typically a stochastic 
process, in which case the queueing network is defined to be stable if the pro- 
cess is so-called positive Harris recurrent; see [18, 19, 41]. Under minor addi- 
tional assumptions, this implies the property sup s>0 X)i<i<iv^[ ( 5i( s )] < °°- 
In our deterministic setting, however, this reduces to the simple condition 
(1). The main goal of the stability research is developing methods for deter- 
mining stability of a given triplet (Q,tt,Q(0)) or pair (Q, 7r). In many inter- 
esting special cases, stability of (Q,vr) is implied by stability of (Q,n, Q(0)) 
for a given initial state Q(0). For example, in the stochastic setting, this 
would be the case provided that the underlying Markov chain is irreducible. 
Due to the deterministic nature of our model, though, this implication does 
not necessarily hold and it is important to make the distinction. 

2.2. The main result. The main result of this paper is establishing the 
undecidability (noncomputability) of the stability property for the class of 
buffer priority policies 9. Precisely stated, it is as follows. 

Theorem 1. The property "(Q, 9, Q(0)) is stable" is undecidable. Namely, 
no algorithm can exist which, on every input (Q,6,Q(0)), outputs YES if 
the triplet (Q,0,Q(O)) is stable and outputs NO otherwise, where Q is an ar- 
bitrary multiclass queueing network, 9 is an arbitrary nonpreemptive static 
buffer priority scheduling policy and Q(0) is an arbitrary vector of initial 
queue lengths. 

To prove Theorem 1, we introduce, in Section 3, a device called a counter 
machine and its stability. Stability of a counter machine is a property closely 
related to the so-called halting property, which is a classical undecidable 
property. 

3. Counter machine, the halting problem and undecidability. A counter 
machine (see [5, 33]) is a deterministic computing machine which is a sim- 
plified version of a Turing Machine — a formal description of an algorithm 
performing a certain computational task or solving a certain decision prob- 
lem. In his classical work on the halting problem, Turing showed that certain 
decision problems simply cannot have a corresponding solving algorithm and 
are thus undecidable. For a definition of a Turing Machine and the Turing 
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halting problem, see [47]. Since then, many quite natural problems in math- 
ematics and computer science have been found to be undecidable, Hilbert's 
tenth problem [39] being one of the most notable examples. The famous 
Church-Turing thesis states that every computable property can be com- 
puted by a Turing Machine. Thus, undecidable problems, that is, problems 
for which a Turing Machine cannot be built, are truly problems not allowing 
constructive solutions. 

More recently, several undecidability results were obtained in the area of 
control theory, some of them using a counter machine; see Blondel et al. [5]. 
For a survey of decidability results in the area of control theory, see Blondel 
and Tsitsiklis [17]. We also use the counter machine device as our reduction 
tool and, thus, in the next subsection, we provide a detailed description of 
a counter machine and state relevant undecidability results. 

3.1. Counter machine and the halting problem. A counter machine is 
described by two counters R±,R2 and a finite collection of states S. Each 
counter Ri contains some nonnegative integer z\ in its register. Depending 
on the current state s£S and on whether the content of the registers is 
positive or zero, the counter machine is updated as follows: the current state 
s is updated to a new state s' € S and one of the counters has its number 
in the register incremented by one, decremented by one or no change in the 
counters occurs. 

Formally, a counter machine is a pair (S, V). S = {s\,S2, ■ ■ ■ ,s m } is a fi- 
nite set of states and T is configuration update function T : S x {0,1} 2 — > 
S x {(— 1,0), (0, — 1), (0,0), (1,0), (0, 1)}. A configuration of a counter ma- 
chine is an arbitrary triplet (s, 2:1,2:2) £ S x A configuration (s, 21,22) is 
updated to a configuration (s', 2i,2 2 ) as follows. Let 1{-} be the indicator 
function. Specifically, for every integer 2, 1{2:} = 1 if z > and 1{2:} = 
otherwise. Given the current configuration (5,21,22), suppose, for exam- 
ple, that T(s, l{zi}, 1(22}) = (s',1,0). The current state is then changed 
from s to s' , the content of the first counter is incremented by one and 
the second counter does not change: z[ = z\ + 1, z' 2 = z%. We will also write 
r: (s,2i,2 2 ) — > (s',zi + 1,2 2 ) and r:s —> s',T:zi — > Z\ + 1,T:22 — > Zi- Sup- 
pose, on the other hand, that T(s, l{2i}, 1{22}) = {s\ (—1,0)). The current 
state then becomes s', z[ = z\ — 1, z' 2 = z 2 . Similarly, if T(s, b) = (s' , (0, 1)) or 
T(s,b) = (s' , (0, —1)), then the new configuration becomes (s', 21,22 + 1) or 
(s',2i,2 2 — 1), respectively. If T(s,b) = (s', (0,0)), then the state is updated 
to s' , but the contents of the counters do not change. It is assumed that the 
configuration update function T is consistent, in the sense that it never at- 
tempts to decrement a counter which is equal to zero. The present definition 
of a counter machine can be extended to the one which incorporates more 
than two counters, but such an extension is not necessary for our purposes. 
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Given an initial configuration (s°, zj,^) G 5 X the counter machine 
uniquely determines the subsequent configurations (s , z\ , z\), (s 2 , z\ , z 2 ), ■ ■ ■ , 

(s t ,z[,Z2), We fix a certain configuration (s*, z*, z 2 ) and call it the ZiaZi- 

configuration. If this configuration is reached, then the process halts and 
no additional updates are executed. The following theorem establishes the 
undecidability (also called noncomputability) of the halting property. 

Theorem 2. Given a counter machine (S, T), initial configuration (s , z®, 
z 2 ) and the halting configuration (s*, z*, £|)> the problem of determining 
whether the halting configuration is reached infinite time (the halting prob- 
lem) is undecidable. It remains undecidable even if the initial and the halting 

configurations are the same, with both counters equal to zero: s° = s*,Zi = 
o * * r\ 

z 2 ~ z l ~ z 2 ~ u ■ 

The first part of this theorem is a classical result and can be found in [32]. 
The restricted case of s° = s*,z® = z*,i = 1,2, can be similarly proven by 
extending the set of states and the set of transition rules. It is the restricted 
case of the theorem which will be used in the current paper. 

3.2. Simplified counter machine (SCM), stability and decidability. We 
say that a counter machine is stable if the value of counters is bounded as 
time goes to infinity. Namely, sup 4 z\ < oo and sup t z\ < oo. It is shown in [26] 
that determining whether a counter machine which started in a given config- 
uration (si,0,0) is stable is an undecidable problem, by a simple reduction 
to the halting problem. 

Definition 2. A simplified counter machine (SCM) is a counter ma- 
chine satisfying the following condition: there exist two functions a : S x 
{0, l} 2 -» S, [3 : S -> {-1, 0, l} 2 such that F(s, z u z 2 ) = (a(s, \{z x > 0}, l{z 2 > 
0}),/?(a(s, l{z\ > 0},l{z2 > 0}))). In other words, while the new state s' 
depends on the entire current configuration (s, Z\,Z2), the incrementing or 
decrementing of counters at the next step depends only on the new state s' . 

It turns out that this restrictive version of a counter machine is still suf- 
ficiently general for our purposes. 

Proposition 1. Given a counter machine, an SCM can be constructed 
such that the SCM is stable if and only if the given counter machine is stable. 

Proof. We modify the state space {sj}, 1 < j < m, to {s° dd }i<j< m U 
{(s<3 ven , ^ £>2)}i<7<m,bi,b2e{-i,o,i}- The transition rules are defined as follows: 
a(sf d ,b u b 2 ) =~(sf ci , Ai, A 2 ) 'if and only if r( Sj ,6 1 ,6 2 ) = (s,, Ai, A 2 ), and 
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Fig. 1. Subnetwork SN t . 



/3(sf ven ,A!,A 2 )) = (A X ,A 2 ). Also, a(4 ven , A ls A 2 ) = sf d and P{sf d ) = 
(0,0). It is then not hard to observe that each transition (sj,zi,Z2) — > 
(si,z[,Z2) with b\ = z[ — zi, hi = z' 2 — Z2 is emulated by two transitions in 
theSCM: (sf d , z u z 2 ) ((sf™,h, h), z[, z' 2 ) -+ (sf d ,z[, z' 2 ). □ 

Corollary 1. Determining the stability of SCMs with a given initial 
configuration = 0, z\ = is an undecidable problem. 



4. Description of the queueing network corresponding to an SCM. Given 
an SCM with states {s±, s%, . . . , s m } and counter update rules a, (3, we con- 
struct a certain multiclass queueing network, a static buffer priority policy 
and the vector of queue lengths at time zero. This network, policy and initial 
state combination will have the property that it is stable if and only if the 
underlying SCM is stable, thus the reduction goal will be achieved. 

We now proceed to the details of the construction. The queueing net- 
work consist of three subnetworks denoted, respectively, SNi, SN 2 and MN, 
which stand for subnetwork 1, subnetwork 2 and the main network; see Fig- 
ures 1 and 2. The subnetwork SNi, i = 1, 2, will be in charge of the updates 
of the counter readings Z\. The network MN will be in charge of updating 
the state Sj of the SCM. We will describe the network structure in detail, 
as well as the buffer priority scheduling policy implemented in this queueing 
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Fig. 2. Main network M N . 



network. The policy is henceforth denoted by 9. All of the buffer capacities 
in the network are either zero or infinite. 

The subnetworks SNi, i = 1,2, are identical in their topological descrip- 
tion. They will only differ in their buffer contents. Hence, we only need to 
describe one of these subnetworks. In Figures 1 and 2, the buffers with in- 
finite capacity are marked by a vertical bar and the remaining buffers have 
finite capacity. 

4.1. The description of the subnetwork SNi, i = l,2. The subnetwork 
SNi consists of five servers, Sij, j = 1, . . . , 5; see Figure 1. The classes (buffers) 
corresponding to server Sij are denoted by triplets ijk. Table 1 lists servers, 
classes (buffers), the next classes (if any), the corresponding (deterministic) 
service times, priorities and the buffer capacities. Service times are shown 
in column 4 and only nonzero service times are shown. If, after service com- 
pletion, the jobs from a given class exit the system, then the corresponding 
entry in the next class column is absent. Thus, the unlisted service time 
entries correspond to zero service time. For each class, we also provide the 
next class to where the jobs are routed after service completion. If the cor- 
responding entry is empty, it means that the job leaves the network after 
the service completion. The fifth column corresponds to the priority of this 
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Table 1 
Servers and classes in SNi 



Server 


Classes 


Next class 


Service time 


Priority 


Capacity 




ill 


i21 




2 


00 




ill 




0.5 


1 


00 




zl3 


£31 




3 







ill 


£31 




4 





5', 2 


£21 




0.5 


1 


00 




£22 


£12 




2 


00 




i23 


£31 




3 





Sis 


i31 




0.04 


2 


00 




i32 




1.1 


1 


00 




i33 


Oli of the network MN 




3 





S, 4 


£41 




0.2 


1 


00 




£42 


ill 




2 





Sis 


£51 


ill 


0.02 


1 


00 



class within the server. For example, the order of priority of classes in server 
Sn is H2, ill, H3, il4, meaning that H2 has the highest priority, ill has the 
next highest priority, etc. The collection of classes Hl,il2,i21,i22 is defined 
to be a "Rybko-Stolyar sub-network," or RSSNi. It indeed describes the 
well-known Rybko-Stolyar network; see [18, 45]. The choice of service times 
in the subnetwork SNi, as well as in the network MN described in the fol- 
lowing section, is somewhat arbitrary, except for service times for classes 
H2, i21 being equal to 0.5. The numbers are arranged so that the proof goes 
through and is easy to follow. Yet the choice of service times in il2,i21 is 
explained by making the corresponding Rybko-Stolyar network critical, in 
some appropriate sense. For more details, refer to the beginning of Section 5. 

There are seven external arrival processes into subnetwork SNi, denoted 
by Aj(0,s), j = 1, . . . ,7. The corresponding information is summarized in 
Table 2. For each arrival process, we describe exact arrival times, as well 
as the class to which the arriving job is routed. For example, the entry i42 
corresponding to the arrival process A l 2 indicates that jobs arrive precisely 
at times 0.02, 1.02, 2.02, . . . and are routed to the class i42. The arrival times 
are represented in the form an + b for some explicit constants a, b. Here, a 
is the interarrival time and b is the initial delay. This means that for every 
nonnegative integer n, an arrival occurs at time an + b. 

4.2. The description of the main network MN . The main network con- 
sists of 2m + 2 servers, where to is the number of states in the SCM. The 
servers are Sqi, S02, ^3j> 3 = 1,2,..., to. The table describing servers, 
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Table 2 
Arrival processes into SNi 



Arrival process 


Classes 


Arrival times 




i22 


n 


4 


£42 


Ti + 0.02 


A', 


il3 


3n + 1.6 


4 


i23 


3n + 2.1 


A\ 


;14 


3n + 2.6 


At 


i32 


3n + 1.5 


A\ 


£33 


3n + 2.7 



classes, next classes, service times, priorities and buffer capacities is given 
below as Table 3. The interpretation is the same as for the table for sub- 
networks SNi. Specific attention is paid to classes 4j3, 1 < j < m, and the 
next classes described generically as "i41,i51 or exit." The jobs departing 
from class Aj3 are routed to: 

1. class 141 if (3{j) = (-1,0); 

2. class 151 if (3{j) = (1,0); 

3. class 241 if (3(j) = (0,-1); 

4. class 251 if [3{j) = (0,1); 

5. exit the network if (5(j) = (0,0). 

In Table 3, some classes within the same server are assigned the same prior- 
ity level. This means that the tie is broken arbitrarily. We prefer to assign 
the same priority level for simplicity. In reality, as we will see, the server will 
never have to prioritize between these classes as at most one of the corre- 
sponding buffers will be nonempty. In order to avoid overcomplicating the 
figure, the servers 3j are described separately for classes 3kl, 3k2, 3k3, 3k4 
and classes 3j'5, although these belong to the same group of servers 3j, j = 
1, .. . ,m. Arrivals into the main network are summarized in Table 4. There 
are 3m external arrival processes into subnetwork MN, denoted by Aj(0, s), i = 
3, 4, 5, j = 1, 2, . . . , m. We have started the index i from 3 to avoid confusion 
with arrival processes Aj,Aj in networks SNi, i = 1,2. The corresponding 
information is summarized in Table 4. The arrival times are again repre- 
sented in the form an + b for some explicit constants a, b. 

We now describe the initial state of our queueing network at time s = 0, 
namely Q(0). At this time, there is one job in class 02j in the main network, 
where j is such that Sj = s* is the initial state of the SCM. The service is 
initiated at time s = 0, so the processing of this job will be over at time 2.71. 
All other buffers in the entire queueing network are empty. 
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Table 3 
Servers and classes in MN 



Server 


Classes 






Next classes 


Service time Priority Capacity 


<Sbi 


Oil 








0.09 


1 


CO 




012 








0.18 


2 


CO 




all 03j 






3jl 




3 


CO 




all 02 j 






03j 


2.71 


1 


00 




3kl, for all k such that 


a(sfe,l,l) 


= s j 


3fc2 


0.09 


1 


00 




3fc2, for all k such that 


a(«*,0,l) 


= S J 


3fc3 


0.09 


1 


00 




3fc3, for all k such that 


a(«fc,l,0) 


= *j 


3fc4 


0.09 


1 


00 




3fe4, for all k such that 


a(s k , 0,0) 


= s j 




0.09 


1 


00 




3j5 






4jl 


0.02 


2 







4jl 








0.02 


1 


00 




4j2 






02j 




2 







4j3 






i41,i51 or exit 




3 






5. Proof of Theorem 1. Our main result, Theorem 1, follows immediately 
from Corollary 1 and the following theorem. 

Theorem 3. The queueing network constructed in the previous section 
with the prescribed initial state Q(0) is stable if and only if the SCM is stable. 

Before we provide details of the proof of Theorem 3, let us present the 
overall idea of the proof in the proof sketch below. 

Proof sketch of Theorem 3. We begin with a brief description of 
the Rybko-Stolyar network RSSNi, which is embedded in our subnetwork 
SNi,i = 1,2, in relation to servers Sn,Si2 and classes ill, il2, i21, i22. In- 
stead of two arrival processes feeding class ill in SNi, suppose that we have 

one external arrival process with arrival times t = 0, 1, Namely, arrivals 

occur at the same times as for arrivals into class i22. Suppose, as it is in our 
case, that class H2 has priority over class ill, and class i21 has priority over 
£22. The service times in classes ill, il2, £21, i22 are set to take the same 



Table 4 
Arrival processes into MN 



Arrival process 


Classes 


Arrival times 




3j5 


3n - 0.01 




4j2 


3n 




4j3 


3n 
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values as in our network SN^. Suppose, also, that at time + , we have m 
jobs in class i21 and no jobs elsewhere. It is a simple exercise to check that 
at time m + , there will be m jobs in class H2 and no jobs elsewhere; at time 
(2m) + , there will be m jobs in i21 and no jobs elsewhere; at time (3m) + , 
there will be m jobs in H2 and no jobs elsewhere, etc. Furthermore, it is 
a simple exercise to see that the total number of jobs in the four classes 
Hl,il2,i21,i22 remains the same m at every integer time t + . 

Now, let us go back to our construction. The two Rybko-Stolyar networks 
RSSNi, i = l,2, embedded into SNi,i = 1, 2, will model the two counters in 
the counter machine, in the sense that the value of the counter i = 1,2 will 
correspond to roughly the number of jobs in the classes ill, zl2, z21, z22 at 
times 3t + 1 (to be exact, it will correspond to the workload corresponding 
to these classes; see below). We will arrange the dynamics so that if, during 
the transition t—^t + 1, a counter i has to increment (resp., to decrement, 
to leave unchanged) its value, then the number of jobs in the Rybko-Stolyar 
part of SNi will increase by one (resp., decrease by one, stays the same) over 
the time period [3i + l,3i + 4]. Specifically, say counter i increments its value 
by one during the transition t — > t + 1. We will arrange for exactly one job 
to arrive from MN into class i51 exactly at time 3t + 3. After an additional 
delay of 0.02 in server S^, it will arrive into class ill at time 3t + 3.02. The 
extra delay of 0.02 is created in order to synchronize with arrivals at time 
t + 0.02 (possibly) coming from class i42. The net result is one extra job in 
the Rybko-Stolyar part of SNi added during [3t + 1, 3t + 4]. 

On the other hand, suppose that counter i decrements its value by one 
during the transition t — > t + 1. We will arrange for exactly one job to arrive 
from MN into i41 at time 3t + 3. This job will occupy server S^ during 
(3t + 3, 3t + 3.2) and, as a result, the job arriving into class i42 at time 
3t + 3.02 will be blocked. The net result (compared to the pure Rybko- 
Stolyar network described above) is that one job is lost. 

The case when the counter does not change simply corresponds to no 
jobs arriving into i41 and i51 at time 3£ + 3, implying no change in the total 
number of jobs in the Rybko-Stolyar part of SNi. 

Furthermore, the classes zl3,il4,i23 and classes in the server are 
constructed so that when a job arrives into zero-buffer class i33 at time 
3t + 2.7, it will be processed immediately and sent to MN if the Rybko- 
Stolyar part of SNi is empty at time 3^ + 1 (namely, counter i is empty) and 
will be blocked and dropped from the network at time 3t + 2.7 otherwise. 
Namely, these classes serve as a testing mechanism for checking whether the 
counter i is empty or not at time t. 

Additionally, there is a correspondence between the states of the SCM 
and the MN network. Specifically, we will arrange that if, at time t, the 
state of SCM is q, then, at time 3t, the server £02 will start working on a 
job in class 02q. The dynamics is arranged so that if the state of SCM at 



16 



D. GAMARNIK AND D. KATZ 



time t + 1 is r, then, at time 3t + 3, the server S02 will start working on a job 
in class 02r, thus building the required correspondence between the network 
MN and the state of the SCM. Specifically, this is arranged as follows. The 
job in class 02q will be processed after 2.71 time units and possibly incur a 
delay in server S03. The delay is either zero, 0.09, 0.18 or 0.27, depending 
on whether there are jobs arriving into classes Oil and 012 from SNi,SN2- 
From the description above, there is a job arriving from SNi if and only if 
counter i is empty at time t. Thus, the four possible delays uniquely identify 
which of the counters i = 1,2 are empty and which are not. Next, the job 
will visit four (possibly repeated) servers among S^j, l<j<m, indexed by 
four states, a(q,0,0),a(q,l,0),a(q,0,l),a(q,l,l), which can follow state q 
in the SCM. Depending on the incurred delay, it will be in exactly one of 
these possible servers at time 3(t + 1) — 0.01 when an external job arrives 
into this server and is thus blocked. We arrange that it is precisely server 
Ss r . The blocked job in buffer 3r5 is prevented from arriving into class 4rl 
at the same time 3(t + 1) — 0.01 and allows jobs in classes 4r2 and 4r3 to 
be processed at time 3(t + 1). These will be the only jobs in classes 4j'2 
and 4j'3, j = 1, 2, . . . , m, which are served at time 3(i + 1). One of these 
jobs arrives into class 02r, thus completing the cycle and indicating that the 
new state of the SCM is r, and the other job is sent to either i41 or i51, 
depending on which of the two counters needs to be updated (if any) and 
whether the update is increment or decrement. □ 

For the remainder of the paper, we focus on establishing Theorem 3. 
We first introduce the following definitions. Let Wi(s) be the combined 
workload of the servers Sn,Si2 in the network SN at time s. Namely, 
it is the amount of service required to serve all jobs in servers Sn,Si2 at 
time s when the scheduling policy 9 is implemented. Observe that W%(s) = 
Wii2(s) + Wi2i(s) + 0.5Qi22(s) + 0.5Qiu(s), where W il2 {s) and Wm(s) stand 
for the time required to process jobs currently in buffers il2, z21 (if any), 
respectively. We will specifically focus on workloads Wi(s~), where s~ indi- 
cates the time immediately preceding s. Thus, if there is an arrival at time 
s, this arrival has not shown up at s~. 

For every integer time instance t = 1,2, . . . , we define the status of the 
main network MN to be the following quantity: for every k = 1,2, ... ,m, 
Status m N (t) = k if, at time t — 1, server S02 of the network MN started 
working on a job in class 02k and there are no other jobs anywhere in the 
network MN at time t. Otherwise, Status MN(t) = — 1- 

For each i = 1,2, we also define the status of the subnetwork SNi at 
a given time 3i + 1 for t € Z + as follows. StatussN t (3t + 1) = 2Wi((3t + 
1)~) if Qn2(3t + l)Qi2i(3t + 1) = and there are no jobs anywhere else 
in the subnetwork SNi, other than possibly in the four classes of RSSN 
(namely, classes Hl,il2,i21,i22). Otherwise, Status sNi(3t + l) = —1. We do 
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not define Status sNi(t) at other values of t. As we will see shortly, the status 
functions at time 3t + 1 will represent the configuration of the SCM at time 
t. Provided that we have initialized our queueing network properly, none of 
the status functions will ever take value —1. 

Theorem 4. If the configuration of the SCM after t steps is (s q ,zi,Z2), 
then Status MN(3t + I) = q and Status sNi(3t + 1) = Zi, * = 1,2. 

Proof. The proof is by induction. For t = 0, the statement of Theorem 4 
holds because the queueing network initialization makes it so. The remainder 
of the paper is devoted to proving the induction step. It is given in Section 
5.1. □ 

We now show how this result implies Theorem 3. 

Proof of Theorem 3. The idea of the proof is to show that a bound 
on the value of counters of the SCM implies a bound on the number of jobs 
in the queueing network at any one time, and vice versa. 

Suppose that the SCM is stable. That means that there is a bound M 
on the maximum value of counters so that z\ and z<i never exceed M . Let 
(sj,zi,Z2) be the configuration of the SCM at time t. Then, by Theorem 4, 
at time (3t + 1)~ , there are z\ < M jobs in SN±, Z2 < M jobs in SN2 and one 
job in the main network. So, at time (3t + 1)~, there can be no more than 
2M + 1 jobs in the queueing network. Since there is only a constant number 
of arrival processes in the network and the arrival process is deterministic, 
for every time period [3t + l,3(t + 1) + 1), the total number of jobs in the 
network is bounded by 2M + C for some constant C which depends only 
on the network parameters. Thus, if the SCM is stable, so is the queueing 
network. 

Conversely, suppose that the network is stable and that, at any time t, 
the total number of jobs in the network does not exceed M for some finite 
value M. Then, M is also an upper bound on Status sN t (3t + 1) for every 
t. By Theorem 4, this implies that the values z\,Z2 of the counters of the 
SCM are bounded by M and therefore the SCM is also stable. □ 

5.1. Proof of the induction step of Theorem 4- This subsection proves 
the induction step of Theorem 4. Thus, we assume that its statement holds 
after t steps and prove that it holds after t + 1 steps. Assume that the 
configuration of the SCM at time t is (s g ,zi,Z2); StatusMN(3t + 1) = q, 
StatussNi(3t + 1) = Zi, i = 1,2. Assume that the configuration of SCM at 
time t + 1 is T(s q , z\,Z2) = (s r ,yi,y2)- We need to show that StatusMN(3t + 
4) = r, Status sNi(3t + 4) = y it * = 1,2. 
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5.1.1. Dynamics in subnetwork SNi. 

Lemma 1. For every time s>0, either Qn2(s) = or Qi2i(s) = 0. More- 
over, 4zWi{s) = —1 whenever Wi(s) > and s € M+ is not an instance of 
arrivals into servers Si\ , S%2 • 

Remark. The first part of the lemma is a well-known fact from the 
stability literature, stating that the classes il2, i21 constitute a virtual server 
such that only one of the two classes can be served at any given time; see 
[21, 24]. 

Proof of Lemma 1. Suppose that the statement of the lemma does 
not hold. Then, let u = inf(s : Qa2(s) > and Qi2i(s) > 0). That means that 
both buffers H2 and i21 are nonempty at time u + , but at least one of the 
two is empty at time vT . Suppose that this holds for buffer £12. This implies 
that there was an (instantaneous) service completion in buffer i22 at time u. 
Class i2l has higher priority than class i22 (consult Table 1). This implies 
that the server S2 was not working on the job in class i21 at time u~ . 
Since, however, class i21 is nonempty at time u + , we conclude that there 
was an arrival into buffer i21 at exactly time u. We conclude that there was 
a simultaneous arrival into buffers H2 and i21 at time u and buffers H2 and 
i21 were empty at time u~ . 

We now show that such a thing is impossible. Since jobs arrive to H2 
from i22 and into i22 from outside at integer times n, we see that u must 
take integer values. We now obtain a contradiction. The jobs arrive into ill 
only from classes £42 and £51. Jobs arriving into £42 arrive from outside 
at noninteger times n + 0.02. Buffer £42 has no capacity and the processing 
time for this class is zero. Therefore, these jobs can ultimately arrive into i21 
only at times n + 0.02 and not at integer times. Jobs arriving into £51 have 
a nonzero processing time 0.02. These jobs arrive from the main network 
MN from classes 4j'3 which correspond to zero capacity buffers and zero 
processing times. Jobs arrive into 4j'3 from outside at integer times 3n. 
Thus, these jobs can ultimately arrive into class i21 only at times 3n + 0.02 
and not at integer times. We conclude that jobs cannot ever arrive into i21 
at integer times. 

Similarly, we consider the case where Qa\{u~) = 0. Since Qi2i{u + ) > 0, 
there was a service completion in buffer ill at time u. We already showed 
above that this can only occur at times of the form n + 0.02. Also, this 
means that Qi2(u~) =0 since class H2 has higher priority than class £11. 
Thus, there was an arrival into £12 at time u, namely there was a service 
completion in £22 at time u. Since Q2i{u~) = and the service time in £22 
is zero, there was an arrival into £22 at u. But these arrivals only occur at 
integer times n. Again, we obtain a contradiction. 
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To establish the last part regarding ^Wj(s), observe that only jobs in 
buffers il2, i21 have nonzero processing times. Since only one of these buffers 
can contain a job, the case Wj(s) > corresponds to the case of exactly one 
of these buffers having jobs as, otherwise, if both H2,i21 are empty, then 
the remaining jobs in servers Sn,Si2 are processed immediately since they 
have zero service time requirement. The assertion then follows. □ 

Lemma 2. There are no arrivals into buffers i41,i51 during the time 
interval [St + 1, 3* + 3) . 

Proof. Arrivals into classes i41 and i51 can happen as a result of a 
departure from one of the classes 4j3 of the network MN . The buffers 4j'3 
have zero capacity and zero processing time. Therefore, service completions 
happen there simultaneously with arrivals from arrival processes Aj. How- 
ever, those arrivals occur only at times 3t. Thus, the first arrival after 3t can 
occur only at time 3t + 3. The assertion then follows. □ 



Lemma 3. During the time interval [3t + 1,3* + 3), exactly one of the 
servers Sn and Si2 is busy and Wi((3t + 2)~) > Wi((3t + l) - ). In addition, 
during this time period, jobs in classes H2 and i21 finish service only at 
times which are multiples of 0.5. 

Proof. By Lemma 1, at most one of servers Sn,Si2 does work at any 
given time. Thus, we need to show that at least one server works during this 
time period. 

By Lemma 2, there are no arrivals into buffers z41, i51 during [3t + 1, 3t + 
3). By the inductive assumption, StatussNi(3t + 1) = z% > 0, implying, in 
particular, that there are no jobs in buffer z41 at time 3t + 1. Thus, buffer 
z41 is empty during [3* + 1,3* + 3). This means that the jobs arriving into 
class i42 at times 3* + 1.02 and 3* + 2.02 will arrive instantly into buffer ill. 
Also, one job will arrive into i22 at time 3t + 1, 3t + 2. By Lemma 1, only one 
of the jobs in buffers H2,i21 can be served at a time. Thus, the dynamics of 
the number of jobs in the subnetwork RSSNi can be viewed as dynamics of 
a single server queue with service time 0.5 and arrivals at times 3t + 1, 3i + 
1.02, 3t + 2, 3t + 2.02. It is then easy then to explicitly construct Wi(s) during 
the time period s € [3t + 1, 3t + 3), given the initial value Wi((3t + 1)~), and 
the graph of Wi(s) is depicted in Figures 3-5. The part [3t + 1,3* + 3) is 
identical in all three figures. The differing parts of the graph corresponding 
to the interval [3i + 3, 3i + 4) will be used later in Section 5.1.2. In particular, 
we see that if Wj((3i+ 1)~) > 0, then Wi(s) is always positive during the time 
interval [3t + 1,3* + 3) and if Wj((3* + 1)~) = 0, then Wi(s) is equal to zero 
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Wi(s) 



Wi{(M + l)-) 



N 




3t + 1.02 



3« + 2.02 



3t + 1.02 



■o — o 



3t+ 1 



3i + 2 



3i + 3 



3t + 4 



Fig. 3. Workload Wi(s): case 1. 



only at time s = 3t + 2. In particular, at least one (and therefore exactly one) 
of the servers Sn,Si2 was busy during the time interval [3i + 1, 3t + 3). We 
also see, by inspection, that Wi((3t + 2)~) > Wi((3i + 1)~). Finally, by the 
inductive assumption, Status sNi(3t + 1) = Z{ = 2Wj((3t + 1)~); in particular, 
it is an integer. This means that there is no service in progress in buffers 
il2,i21 at time 3t + 1. Thus, whether or not there are prior jobs in buffers 
zl2,i21 at time 3t + 1, there will be service completions exactly at times 
3i + 1.5, 3t + 2, 3i + 2.5 and 3t + 3, as seen by again inspecting Figures 3-5. 
This proves the second assertion of the lemma. □ 

Lemma 4. Suppose that Status SNi(3t + l) > 1. Then, the job J arriving 
at time 3t + 2.7 from outside according the arrival process A\ will be routed 
to buffer OH of the network MN at time 3t + 2.7. 

Proof. At time 3t + 1.5, a job arrives into class i32 which requires 1.1 
units of processing time. Since i32 is the highest priority class in server 
this server will be busy until time 3t + 2.6. Also, this class having the highest 
priority implies that there is only one job of this class at a time. Thus, at 
time 3t + 2.6, buffer i32 is empty. Buffer i31 has the second highest priority 
and buffer i33, to where the job J arrives, has the lowest priority. Thus, 
whether J will be blocked from service at arrival time 3t + 2.7 depends on 
the number of jobs in buffer i3\ at time 3t + 2.7. The processing time for 
these jobs is 0.04. Therefore, J will not be blocked if and only if there are 
at most two jobs in i3\ since, then, these jobs will be processed not later 
than 3t + 2.6 + 0.04 + 0.04 < 3t + 2.7 and, otherwise, they will be processed 
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at time 3t + 2.6 + 0.04 + 0.04 + 0.04 > 3t + 2.7. We conclude that J will be 
blocked if and only if there are at most two jobs in buffer i31. We now show 
that this is indeed the case provided StatussNi(3t + 1) > 1. 

Jobs arriving into buffer i31 depart from classes H3, HA and i23. These 
buffers have zero capacity and zero service time. Therefore, they can arrive 
into i31 only at a time of arrival into these three buffers, namely at times 
3t + 1.6, 3t + 2.1 and 3t + 2.6. In particular, there will be up to three jobs in 
buffer i31 at time 3t + 2.6. Thus, we need to show that it is impossible for 
all of these three jobs to arrive into i31. We will show that at least one of 
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these jobs is blocked. By Lemma 3, either server Sn or Si2 is busy during 
[3t + l,3i + 3). Suppose that the job arriving into il3 at time 3t + 1.6 is 
not blocked. This means that S& is busy at time 3t + 1.6. By Lemma 3, 
it will remain busy until 3t + 2. If it remains busy after this time, then it 
will remain busy until 3t + 2.5, the job arriving into i23 at time 3t + 2.1 is 
blocked and the assertion is established. Thus, the only remaining possibility 
is that Si2 finishes service at time 3t + 2 and remains idle after this. We will 
show that a job arriving into il4 at time 3t + 2.6 will then be blocked and 
the proof is then complete. By Lemma 3, Wj((3t + 2) _ ) > Wi((3i + 1) _ ) > 1. 
Thus, there is at least one job in either Sn or i21 at time (3t + 2)~ which 
still requires 0.5 units of processing time. We claim that at time (3t + 2) + , 
it is in il2. Indeed, it cannot be in H2 since the server is idle at this time. 
For the same reason, it cannot be in i22 since service time in this buffer is 
zero. Also, it cannot be in ill since Sn was idle at (3t + 2)~ and the arrivals 
into ill do not occur at integer times. We conclude that there is at least 
one job in il2 at time (3t + 2) + and no jobs in ill,i21,i22 at this time. At 
time 3t + 2, there is an arrival into i22 which then immediately proceeds to 
il2. Thus, we have at least two jobs in il2 at time (3t + 2) + . The server will 
work on them during [3t + 2, 3t + 3) and will block a job arriving into il4 at 
time 3t + 2.6. This completes the proof. □ 

Lemma 5. Suppose that StatussNi{3t + 1) = 0. A job J arriving at time 
3t + 2.7 from outside will then, according to the arrival process A\, exit the 
system immediately. 

Proof. The proof is very similar to the proof of the previous lemma. 
We need to show that all three jobs arriving into classes il3, i23 and il4 
at times 3t + 1.6, 3t + 2.1 and 3t + 2.6, respectively, will not be blocked and 
will be in buffer i31 at time 3t + 2.6. Suppose that StatussNi(3t + 1) = 0, 
that is, Wi((3t + 1)~) = 0. The job arriving at time 3t + 1 into buffer i22 
according to A\ will then immediately proceed to buffer il2 and occupy 
server Sn during the time interval (3t + l,3t + 1.5). By Lemma 2, the job 
arriving into buffer i24 at time 3t + 1.02 according to A\ will be processed 
immediately in buffer i42 and proceed to buffer ill. It will be delayed in 
buffer ill until 3t + 1.5 and, at this time, will depart to buffer i21 and 
occupy server Si2 during the time interval (3t + 1.5, 3t + 2). Then, again, a 
job arriving at 3i + 2 into i22 will proceed into il2 and occupy the server Sn 
during the time interval (3t + 2,3t + 2.5). Finally, the job arriving into i42 
at time 3t + 2.02 will be delayed in ill until 3t + 2.5 and will then occupy 
Si2 during (3t + 2.5, 3t + 3). It is clear from this dynamics that all of the 
three jobs arriving at times 3t + 1.6,3t + 2.1 and 3i + 2.6 into buffers il3, i23 
and il4 will be processed immediately and arrive into buffer i31 at the same 
times, 3t + 1.6, 3t + 2.1 and 3t + 2.6. □ 
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Combining the results of Lemmas 4 and 5, we obtain the following con- 
clusion. 

Corollary 2. If StatussNi(3t + 1) > 1, then exactly one job arrives 
into the class Oli of network MN at time 3t + 2.7. If Status sNi(3t + 1) = 0, 
then no job arrives into Oli at time 3t + 2.7. 

5.1.2. Dynamics in MN . We now switch to the analysis of the dynamics 
in network MN. Recall that, by the inductive assumption Status mn (3t + 
1) = q, we have one job in class 02q at time 3t + 1, which started service 
at time 3t, and there are no other jobs in MN at time 3t + 1. We call 
this unique job /C. Recall that the configuration (q,x±,X2) of the SCM at 
time t is assumed to be updated to the configuration (r, 2/1,2/2) at time 
t + 1. Introduce mi = a(s q , 1, 1), 771,2 = ct(s q , 0, 1), 7713 = a(s q , 1, 0) and m4 = 
a(sq, 0,0). Namely, mi, ma, 777,3,7714 are the four possible values of the state 
r. 

Lemma 6. During the time interval (3t + 2.98, 3t + 3.07), the job K, will 
be in server 3r, buffer 37774 (resp., buffer 37773 or 3rri2 or 3m\) if and only 
if x\ = X2 = (resp., if and only if x\ = 1, xi = or x\ = 0, X2 = 1 or xi = 
X2 = 0). This job will leave the network before time 3t + 0.34. 

Proof. By the inductive assumption, the job K, will finish service in 
buffer 02(7 at time 3t + 2.71 and will arrive into buffer §3q. It will possibly 
experience a delay in the corresponding server S01 which depends on the 
presence/absence of jobs in buffers 011,012. We now consider four possible 
cases: 

1. Case x\ = X2 = 0. By the inductive assumption, this means that 
Status 5JVi(3t + 1) = Status sN 2 (3t + 1) = 0. By Corollary 2, this means 
that at time 3t + 2.7, no jobs arrive into buffers 011, 012. Since only jobs 
arriving from buffer i33, that is, ultimately from A l 7 , can possibly get 
into buffers 011, 012, these buffers are empty until at least 3(t + 1) + 2.7. 
In particular, the job fC arriving into 03q at time 3t + 2.71 will find an 
idle server and will proceed immediately to buffers 3mi, 3m2, 37713 and 
37774. In each of these buffers, it has the highest priority. Since the service 
time in each of these buffers is 0.09, it will arrive into these four buffers at 
exactly the times 3t + 2. 71, 3t + 2.8, 3t + 2.89 and 3t + 2.98. In particular, 
it will be in buffer 37774 during the time interval (3t + 2.98, 3t + 3.07) and 
the assertion is established. 

2. Case x\ = 1,X2 = 0. By the inductive assumption, this means that 
Status sn-l (3t + 1) > 0, Status sn 2 (3£ + 1) = 0. By Corollary 2, this means 
that at time 3t + 2.7, no job arrives into buffer 012 and one job ar- 
rives into buffer 011. This job has the highest priority and requires 0.09 
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units of processing time. The only difference with the previous case, 
then, is that the job K. now experiences a delay of 0.09 in server Sol- 
Thus, it will arrive into buffers mi, m.2, and 1714 at exactly the times 
3t + 2.8, 3t + 2.89, 3t + 2.98 and 3t + 3.07. In particular, it will be in the 
buffer 3rri3 during the time interval (3t + 2.98, 3t + 3.07) and the assertion 
is thus established. 

3. Case x\ = 0, X2 = 1. The analysis is similar. We observe that we will have 
one job in buffer 012 and no jobs in buffer 011 at time 3t + 2.7. This 
buffer 012 has the second highest priority; the job /C will experience a 
delay of 0.18, the processing time of a job in buffer 012. 

4. Case x\ = X2 = 1. The analysis is similar. In this case, we have one job in 
buffer 011 and one job in buffer 012. The job /C is delayed by 0.18 + 0.09 = 
0.27 time units. 

Finally, we again see, by considering the four cases, that the job K, will 
depart from the network at time 3t + 3.34, at the latest. This completes the 
proof of the lemma. □ 

Lemma 7. At time (3t + 3) _ , the server S^ r is idle and the servers 
S&j, j 7^ r, are busy processing jobs in buffers 4jl. 

Proof. At time (3t + 3)~ , the servers S&j can be busy only serving jobs 
in buffer 4jl. These jobs arrive from zero capacity buffer 3j5. These jobs 
have the highest priority in server S&j and the second highest in S^j- Also, 
these jobs arrive at time 3(t + 1) — 0.01 into 3j5. The only way for these jobs 
to be dropped from zero capacity buffer 3j5 is by a higher priority buffer in 
these servers (i.e., one possibly serving job /C) being occupied. By Lemma 6, 
this is the case for exactly one server, namely server 3r. □ 

Lemma 8. Status mjv(3£ + 4) = r. 

Proof. We need to show that at time 3t + 4, in network MN, there 
is one job in class 02r which initiated service at time 3t + 3 and no jobs 
elsewhere. By Lemma 6, the job K, will leave the network before time 3t + 
3.34 < 3t + 4. The jobs arriving into zero capacity buffers 4j'2,4j3, j ^ r, 
at time 3t + 3 will find, by Lemma 7, a busy server 4j and will be dropped 
from the network. The job arriving into buffer 4r3 at time 3t + 3 will find, 
by Lemma 7, an idle buffer and will immediately proceed to one of the 
subnetworks SNi. The jobs arriving into buffers 3j5 at time 3t + 3 — 0.01 
will either be dropped from the network or will proceed to buffers 4jT and, 
after an additional service time 0.02, will leave the network. Thus, they will 
leave the network before time 3t + 3 + 0.01 < 3t + 4. We conclude that only 
the job arriving into buffer 4r2 at time 3t + 3 can remain in the network. 
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By Lemma 7, it will find an idle server 5*4 r and will proceed immediately to 
buffer 02r and begin service there at time 34 + 3. This completes the proof. 
□ 

Lemma 9. There are no arrivals into classes i41, i51 during the time 
period [34 + 1, 3t + 4], other than, possibly, at time 3t + 3. At time 3t + 3, at 
most one job arrives into the four classes 141, 151, 241 and 251. Specifically: 

1. Ai4i(3* + 3) = 1 z//3(s r ) = (-l,0); 

2. A 15 i(3* + 3) = 1 tf/3(s r ) = (l,0); 

3. ^ 24 i(3t + 3) = l i//3(a r ) = (0,-l); 

4. A 251 (34 + 3) = l z//3(s r ) = (0,l); 

5. no arrivals if 0(s r ) = (0,0). 

Proof. Arrivals into i42 and i52 can occur only from buffers 4j3. These 
buffers have zero capacity and zero processing times. The arrivals into these 

buffers occur at times 3n, n = 0, 1, By Lemma 7, only server 4r will 

process a job at time 3t + 3 in buffer 4r3. According to Table 3 and the 
corresponding description, it will be routed to one of the buffers z41, i51 or 
will leave the network precisely as described by the lemma. □ 

Lemma 10. The following hold for i = 1,2: 

1. Statusi(3t + 4) = Statusi(3t + 1) if A m (3t + 3) = A i51 (3t + 3) = 0; 

2. Statusi(3t + 4) = Status i(3t + 1) - 1 if A i41 (3t + 3) = 1; 

3. Status i(3t + A) = Status i(3t + 1) + 1 if A i51 (3t + 3) = 1 . 

Proof. By Lemma 1, we have Qin{3t + 4)Q i2 i(3t + A) = 0. Let us show 
that at time 3t + 4, there are no jobs in SNi, other than, possibly, RSSNi. 
By the inductive assumption, we have StatussNi(3t + 1) > 0. In particular, 
at this time, there are no jobs in SNi outside of RSSNi. We need to show 
that no jobs arriving during (3i + 1, 3t + 4] can be outside of RSSNi at time 
3i + 4. 

By Lemma 9, jobs can arrive into i41,i51 during (3t + 1,34 + 4] only 
at time 34 + 3 and only one such job can arrive. Upon arrival, they will 
experience service time of either 0.2 in z41 or 0.02 in buffer i51 and they 
will thus leave the network by time 3t + 3.2, at the latest. 

The jobs arriving into i42 at times 3t + 2, 34 + 3, 34 + 4 will either be 
dropped or proceed to buffer ill, which is a part of RSSNi. Thus, at time 
34 + 4, these jobs will either be in RSSNi or will leave the network (no jobs 
in RSSNt feed buffers outside of RSSNi). 

We have already analyzed the dynamics of the jobs which arrived into 
buffers i 13, H4, i23, i32 and z33 at times 34 + 1.6, 34 + 2.1 and 34 + 2.6 as 
part of the proofs of Lemmas 4 and 5. In particular, we saw that these jobs 
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leave SNi before time 3t + 2.72. We have established that there are no jobs 
in SNi outside of RSSNi at time 3t + 4. 

It remains to analyze the value of Statusi at time 3t + 4. We consider the 
corresponding three cases: 

1. Aj4i(3£ + 3) = Ai§i(3t + 3) = 0. By Lemma 9, there were no arrivals into 
classes i41, i51 in time interval [3i + l,3i + 4]. Consider the quantity 
Wi(s) during this time interval. As long as Wi(s) > 0, by Lemma 1, 
■^Wi(s) = — 1 at time instances s not corresponding to the arrival in- 
stances. However, we have arrivals into i22 at times St + 1, 3t + 2 and 
3t + 3, and into i42 at times St + 1 + 0.02, St + 2 + 0.02 and St + 3 + 0.02, 
ensuring that W,(s) is not for any period of positive length during 
[St + 1, 3t + 4); see Figure 3. In this situation, Wi(s), over the time inter- 
val [3t + 1, 3t + 4) , increases by 3 units due to 6 arrivals, and decreases by 
3 units due to 6 service completions. Thus, Wj((3i + 4)~) = Wj((3t + 1)~). 

2. y4j5i(3t + 3) = 1. The job arriving into i51 at time 3t + 3 after a delay 
of 0.02 will arrive into ill, thus increasing Wi(s) by 0.5 at time s = 
3t + 3.02; see Figure 4. Therefore, W;((3i + 4)") = Wi((St + 1)") + 0.5 
and Status sNi{3t + 4) = Status sNi(3t + 1) + 1. 

3- ^4i4i(3t + 3) = 1. The job arriving into i41 at time 3t + 3 will occupy 
server for 0.2 time units. As a result, the job arriving into z42 at time 
3t + 3.02 will find a busy server and will be dropped from the network. 
Comparing this situation with the case ^4j4i(3t + 3) = ^4j5i(3t + 3) = and 
consulting Figure 5, we obtain the same situation, except that there are 
no arrivals into ill at time 3t + 3.02. The net result is that VF((3t + 4)~) = 
W((3t + 1)") - 0.5 and Status sN^t + 4) = Status SNi (3t + 1) - 1. 

This completes the proof. □ 

As an immediate corollary of Lemmas 9 and 10, we obtain the following. 

Corollary 3. Statusi(3t + 4) = yi and Status2(3t + 4) = y 2 - 

Lemma 8 and Corollary 3 prove the induction step for Theorem 4, so its 
proof is now complete. 

5.2. Load factors. We will establish below that for some servers in the 
queueing network constructed in Section 4, the corresponding load factors 
are greater than unity. As we saw from the proof of our main result, since 
some of the buffers in our network are finite, overloading some of the servers 
does not necessarily lead to instability. Yet, this is a significant departure 
from the standard assumption ps < 1 in most of the literature on stability. 
The goal of this section is to show that simple modifications of our network 
lead to the same, or a very similar, dynamics, while ensuring the ps < 1 
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condition. Thus, our undecidability result extends to networks with the ps < 
1 condition satisfied by all servers. 

We now compute the load factors ps for each server S encountered in our 
constructed queueing network and construct appropriate modifications. We 
begin with the subnetwork SNi. Let us compute the load factors ps iV i = 
1,2, j = 1,2, ... ,5, of the five servers in SNi. The only class in server Sn 
with nonzero service time (equal to 0.5) is class il2. The arrival rate Aji2 
into this class equals the external arrival rate into class i22, namely Ai22 = 1- 
Thus, ps iL = 0.5 < 1 and no modification is needed. 

Now, consider server S^. The only class in this server with nonzero service 
time, equal to 0.5, is class i21. The total arrival rate into this class is Aj2i = 
Ai42 + S j ^4j3 ? where A4J3 is the external arrival rate into class 4j3 in the 
main network MN and the sum is over all j such that class 4j'3 sends 
jobs into class i51. By construction, Aj42 = 1 and A473 = 1/3. Thus, ps i2 < 
(1 + Zi/3)(0.5), where l\ is the total number of such classes. As a result, 
this server is possibly overloaded. We now modify our network as follows. In 
front of the class i51, which is fed by jobs from MN, we create a new server 
with l\ + 2 classes. The first l\ of the classes correspond to arrivals from 
MN which were originally routed into i51. The service rate of these jobs 
is zero, the buffer size is also zero and, upon service completion, the jobs 
leave the network. The (m + l)st class has external arrivals at exactly the 
times 3t (which are arrival times for classes 4j'3) and service time 0.03. This 
class has zero buffer and, upon service completion, jobs leave the network. 
Finally, the class m + 2 has arrivals at times 3i + 0.01, service times 0.01, 
zero buffer and, upon service completion, jobs are routed into the buffer of 
the class ill. The first l\ classes have the higher priority than class l\ + 1, 
which, in turn, has higher priority than the class l\ + 2. The load factor of 
the new server is (l/3)(0.03 + 0.01) < 1. Now, let us see how the new server 
changes the dynamics in the original network. If there is at least one job 
arriving into classes 1, . . . ,l\ in this new server (and we know that only one 
can arrive at a time), then, since this can only happen at times 3t, the job 
arriving into class l\ + 1 is blocked and is dropped from the network. As a 
result, the job arriving into l\ + 2 at time 3t + 0.01 is not blocked and is 
routed into ill at time 3t + 0.02. On the other hand, if no jobs arrive in 
classes 1, ...,fi at time 3t, then the job arriving into l± + 1 at time 3t is 
worked on during the time interval [3t, 3t + 0.03] and blocks the job arriving 
into l\ + 2 at time 3t + 0.01, the latter job being dropped from the network. 
The net effect is the same as when compared with the earlier model: there 
is one job arriving into ill at time 3t + 0.02 if and only if there is one job 
arriving into this class in the original network. But, now, the load factor p$ i2 
of the server S i2 is (1 + l/3)(0.5) < 1. 

Now, consider server S^. We check, in a straightforward way, that ps a = 
3(l/3)(0.04) + (1/3)(1.1) < 1. 
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Considering server Sa, we see that its load factor, p$ iA = Z2(l/3)(0.2), 
may be bigger than unity, where I2 is the total number of classes 4j3 which 
may send jobs from MN to class i41. Our modification of the network is 
very simple: replace the service time 0.2 in i41 by 0.2/m, make arrivals of A\ 
at times t, instead of t + 0.02, and make service times at z42 equal to 0.02. 
This makes the load factor of Sa at most (l/3)m(0.2/m) + 0.02 < 1. The 
net effect is the same: if there is an arrival from MN into i41, this arrival 
can occur only at times 3t and only one job can arrive at a time. This job 
occupies the server during [3t,3t + 0.2/m] and blocks any job arriving into 
i42 according to A\ at time 3t. The latter job is then dropped. If, however, 
no job arrives into z41 at time 3i, then the job arriving into i42 at time 3t is 
processed and, at time 3t + 0.02, it reaches ill, as in the original network. 

For server S^, our earlier modification, namely a new server in front of 
class z51, implies that the new load factor is only ps i5 = (1/3) (0.02) < 1. 

We now turn to the main network MN. Let us compute the load fac- 
tors pSonPSoaiPSzjiPSAjA <j<m, of the servers in MN. We have p S(n = 
(l/3)(0.09) + (1/3)(0.18) < 1 (the two arrival rates 1/3 are for jobs arriving 
from subnetworks SN±, SN2, corresponding to classes 133, 233). As for server 
S02, we have ps 02 = m(l/3)(2.71) and this server is possibly overloaded as 
well. We simply replace this server with m identical servers, each dedicated 
to serving class 02 j, j = 1, . . . , m. Recall that the only function of the server 
S02 was to introduce a fixed delay of 2.71. Each one of the new m servers 
has load factor (1/3) (2.71) < 1. 

Now, let us consider servers S^j. We have ps 3j = h (1/3) (0.09), where I3 
is the number of classes 03j in server Sox which can send jobs into server 
S^j ■ Note that this is also the number of states which can transition into the 
state j in the SCM. Note that Z3 can be as large as 4m. Thus, this server can 
be overloaded. Our modification is as follows. Instead of each server S^j, we 
create 4m servers Ssj s , s = 1, . . . , 4m. Jobs arriving into classes 3kl in server 
Ssj in the original network instead go through servers S$ji, . . . ,S S ju m \, in 
this order, with service requirement 0.09/(4m) in each server. Jobs arriving 
according to into class 3j5 in the modified version have to go through all 
of the 4m servers Ssji, . . . ,S3jU m \, with zero service time requirement and 
zero buffer, and are ultimately routed into class 4jl, as was the case in the 
original network. It is easy to see that we obtain the same net effect: pro- 
cessing one job in class 3kl for 0.09 time units is replaced by 4m subsequent 
processing stages, each with processing time 0.09/ (4m). The load factor in 
each new server is at most (4m)(0.09)/(4m) < 1. 

Finally, observe that p Sij = (1/3) (0.02) < 1. 

This completes the description of the modified network in which the con- 
dition ps < 1 is satisfied by every server S. 
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6. Conclusion. We have established that there does not exist an algo- 
rithm for determining stability of a multiclass queueing network operating 
under a static nonpreemptive buffer priority scheduling policy. Namely, the 
underlying problem is undecidable. There are, however, special cases for 
which the stability can be determined. Characterization of those special 
cases is of interest. Also of interest is whether our undecidability result 
holds for FIFO scheduling policy, another frequently studied scheduling pol- 
icy. Our model incorporated several simplifying assumptions which depart 
from standard assumptions in the literature on stability of queueing net- 
works. Specifically, we considered networks with possibly finite buffers and 
zero service times. We have little doubt that the stability property remains 
undecidable, even for multiclass queueing networks, without these assump- 
tions, but, at present, we do not have a proof of this. 
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